SHED: Shannon Entropy Descriptors from Topological Feature Distributions
نویسندگان
چکیده
A novel set of molecular descriptors called SHED (SHannon Entropy Descriptors) is presented. They are derived from distributions of atom-centered feature pairs extracted directly from the topology of molecules. The value of a SHED is then obtained by applying the information-theoretical concept of Shannon entropy to quantify the variability in a feature-pair distribution. The collection of SHED values reflecting the overall distribution of pharmacophoric features in a molecule constitutes its SHED profile. Similarity between pairs of molecules is then assessed by calculating the Euclidean distance of their SHED profiles. Under the assumption that molecules having similar pharmacological profiles should contain similar features distributed in a similar manner, examples are given to show the ability of SHED for scaffold hopping in virtual chemical screening and pharmacological profiling compared to that of substructural BCI fingerprints and three-dimensional GRIND descriptors.
منابع مشابه
Shannon entropy in generalized order statistics from Pareto-type distributions
In this paper, we derive the exact analytical expressions for the Shannon entropy of generalized orderstatistics from Pareto-type and related distributions.
متن کاملImplementing the Fisher's Discriminant Ratio in a k-Means Clustering Algorithm for Feature Selection and Data Set Trimming
The Fisher's discriminant ratio has been used as a class separability criterion and implemented in a k-means clustering algorithm for performing simultaneous feature selection and data set trimming on a set of 221 HIV-1 protease inhibitors. The total number of molecular descriptors computed for each inhibitor is 43, and they are scaled to lie between 1 and 0 before being subjected to the featur...
متن کاملFeature Selection for Descriptor Based Classification Models. 2. Human Intestinal Absorption (HIA)
We show that the topological polar surface area (TPSA) descriptor and the radial distribution function (RDF) applied to electronic and steric atom properties, like the conjugated electrotopological state (CETS), are the most relevant features/descriptors for predicting the human intestinal absorption (HIA) out of a large set of 2934 features/descriptors. A HIA data set with 196 molecules with m...
متن کاملRED: A Set of Molecular Descriptors Based on Re'nyi Entropy
New molecular descriptors, RED (Renyi entropy descriptors), based on the generalized entropies introduced by Renyi are presented. Topological descriptors based on molecular features have proven to be useful for describing molecular profiles. Renyi entropy is used as a variability measure to contract a feature-pair distribution composing the descriptor vector. The performance of RED descriptors ...
متن کاملQSPR study on benzene derivatives to some physico-chemical properties by using topological indices
QSPR study on benzene derivatives have been made using recently introduced topological methodology. In this study the relationship between the Randic' (x'), Balaban (J), Szeged (Sz),Harary (H), Wiener (W), HyperWiener and Wiener Polarity (WP) to the thermal energy (Eth), heat capacity (CV) and entropy (S) of benzene derivatives is represented. Physicochemical properties are taken from the quant...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of chemical information and modeling
دوره 46 4 شماره
صفحات -
تاریخ انتشار 2006